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Abstract: Various kinematical quantities associated with the statistical properties of dynami- 
cal systems are examined: statistics of the motion, dynamical bases and Lyapunov exponents. 
Markov partitons for chaotic systems, without any attempt at describing "optimal results" . The 
nI I Ruelle principle is illustrated via its relation with the theory of gases. An example of an appli- 

cation predicts the results of an experiment along the lines of Evans, Cohen, Morriss ' work on 
viscosity fluctuations. A sequence of mathematically oriented problems discusses the details of 
the main abstract ergodic theorems guiding to a proof of Oseledec 's theorem for the Lyapunov 
exponents and products of random matrices. 
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§_Z Dynamical systems and their statistics. 

>; 

OO I A dynamical system (C, S) will consist of a piecewise smooth compact manifold and of a 

piecewise difFeomorphic map S of C into itself with a piecewise diffeomorphic inverse S~^.^ 



o 

O ■ 

, The set J\ of the singularities of either S or of the manifold C will be called the set of the 

singularities of (C, S*).^ 

■ The reader disturbed by the above generality can simply think that all I am discussing is the 

case of a C°° diffeomorphism of a C°° compact manifold without boundary. The concession 
to the generality is due to the fact that some of the most important dynamical systems really 
show physically significant singularities, like the billiards or the hard sphere gases. 



O I Example 1: A Hamiltonian system with I degrees of freedom is observed at the instants in which 



a certain event, the timing event, happens. This is the event in which the point representing 
^ , the system passes through a predefined surface in phase space {e.g. one particle of the system 

passes through an ideal wall in physical space or the distance between two particles takes some 
prefixed value r). If C is the family of timing events with a prefixed energy Hq, and S is the 
r> ' transformation mapping one event in C into the following, then (C, S) is a dynamical system. 



■ Dipartimento di Fisica, 7° Universita di Roma, P.le Moro 2, 00185, Roma, Italia. This paper is archived in 
mp_arc<S^math.utexas .edu, #94-333. 

^ By piecewise smooth manifold I mean a manifold that can be regarded as the union of a finite number of C°° 
compact manifolds, with any two of them having in common only boundary points: the union of the boundaries 
of such manifolds will be called the set of singularities of C. Similarly a map of C into itself is said to be piecewise 
smooth if C can be regarded as the union of finitely many compact C°° manifolds and S is a diffeomorphism 
as a map of the interior of each such manifold and its image. A compact C°° manifold, of dimension d, is a 
C°° manifold with boundary consisting of finitely many compact C°° manifolds of dimension < d with only 
boundary points in common; the inductive definition is started by declaring that a point is a C°° manifold of 
dimension 0. 

Sometimes one may wish to consider as singular points some points where S is in fact regular: in this case such 
points will also be supposed to lye on piecewise smoooth submanifolds of C (of lower dimension) and will be 
included in the set of singularities. Furthermore, since the action of S on the singular points will not be studied, 
one may wish to require that S^^ be undefined or arbitrarily defined on the points x 9 S{C/Af) so that in such 
points the property S{S~^x) = x may even fail. And likewise one may avoid defining S on Af. I will not try to 
be so general: see [P]. 
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The probability distribution fiL defined by assigning to a (measurable) set of events E C C the 
Liouville measure of the set of phase space points on the energy surface experiencing the first 
timing event in E, is an invariant measure {i.e. Hl{E) = ijll{S^^E)). The hl will still be called 
the Liouville measure. 

Example 2: consider a box, [—^L, \L\^ , with side L and periodic boimdary conditions (i.e. op- 
posite sides identified) and with a few circular regions Cj, "obstacle of radius" rj, in it. A 
particle moves freely among them and collides elastically, at velocity v. The space C consists 
of the collisions^ parameterized by the point a on the obstacle where the collision takes place 
and by the angle formed between the incoming velocity and the outer normal to the collision 
point: hence ^ e [— I, |]. We see that C consists in the union of dCi x [—^^ §]• A point in C is 
determined by 2 coordinates [a, ip), with a € [0,27r] being the angular position of the collision 
point and a label i denoting the obstacle on which the collision takes place. The obstacles will 
be supposed so distributed so that no coUisionless trajectory is possible. The map S is smooth 
everywhere except on the boundary of C {i.e. (f = ±7r) and on the collisions x = {i, a, ip) such 
that Sx is a tangent collision. This is the billiard system. A natural metric on C is da'^ + dip"^. 
The system being hamiltonian it is easy to find the Liouville distribution: in the coordinates 
(a, ip) it is given by Trida cos ip dip where (a, (/?) represent collisions with the obstacle i and F 
is defined so that Hl{C) = 1. 

Example 3: many particles, say N ^ in a periodic box as in the example 2. The particles 
interact with a radial pair potential v (with range smaller than L) and with an external potential 
represented by the hard core due to the obstacles. The phase space is and the equations of 
motion are: 

1 = —P., P. = F +Ei- ap (1.1) 

where i is the 1-axis unit vector, E is an external constant field, F_j is the force generated by 
the total potential $ on the j-th particle; the obstacles are taken into account by the elastic 
reflection rule and a is so defined that the energy H = J2j l^P] + ^ is a constant of motion 
{gaussian thermostat): i.e. a = Ei- p ./ p"^. /2m, as a simple calculation proves; m is the 
particles mass. The phase space C consists of the energy Hq configurations in which one particle 
(any one) is colliding with some obstacle. The time evolution S maps one such configuration 
into the following configuration of the same type ("next collision configuration"). The phase 
space thus defined is a 4N — 2 dimensional subspacc of the total 47V dimensional phase space; 
and a simple calculation of the divergence of the r.h.s. of (1.1) shows us that the phase space 
volume changes at a rate {2N — l)a in the total 4 AT dimensional phase space J^. This example 
has a rather involved set of singularities: nevertheless it is pieccwisc smooth and it defines a 
dynamical system in the above sense. Physically it is a model for electric conductivity {Lorentz 
gas conductor). 

Example The map of the torus C = [0,27r]^ defined by: S{ipi,ip2) = {fi + <P2,Vi + 
2(^2) mod 2n. One also writes: 

Then S preserves the distribution iio{dip) = d(^/(27r)^. This system has no singularities. 

Given a dynamical system (C, S) we shall consider initial data chosen randomly with respect 
to a given probability distribution /iq. It is customary to consider the "Liouville distribution": 
the latter means a probability distribution with a positive density with respect to the volume 
on the phase space, as one often calls the space C. Sometimes it could literally coincide with the 
Liouville measure of analytical mc;chanics; e.g. when the systc^ni [C.S) is obtained by timing 
the evolution of a hamiltonian system: this is the case in examples 1,2. 
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The qualitative theory of (C, S) studies the asymptotic properties of the motions following 
initial data chosen randomly with distribution fiQ, which wc call "/io ^random data". 

More generally one could investigate the properties enjoyed by motions with initial data cho- 
sen randomly with respect to other probability distributions /i not absolutely continuous with 
respect to the volume measure. Sometimes this is regarded as a less interesting question on the 
grounds that the Liouville distribution is the natural one to use in selecting initial data. This 
is a preconception; in fact it is well known that it is equally easy (or, rather, difficult) to pro- 
duce random initial data with a distribution singular with respect to the uniform distribution: 
therefore such data are equally interesting; or at least one has to find better reasons to regard 
them less interesting. This being not the place to undertake a learned philosophical discussion 
on the preminence of the Liouville measure /Uq, I shall concentrate on the most studied question 
of which is the asymptotic behaviour of motions with initial data chosen randomly with distri- 
bution jLto- Many of the properties that will emerge will be relevant also for the other random 
choices with respect to less appreciated distributions. 

The key notion is that of statistics: 

1 Definition: Given a dynamical system (C, S) and a probability distribution ji attributing 

probability 1 to the set of points x which never in their evolution Jail on the singularity set Af 
(i.e. M(n^QS'~'°(C/A/')) = 1), we say that no has a statistics with respect to the evolution S if 
for any continuous function F on C, called observable, the time average of F on the motion 
generated by ^-almost all points x gC exists, and has the form: 



where JI is a suitable probability distribution on C, called the statistics of ji in the dynamical 
system {C,S). If ii =^ fiQ is the Liouville distribution thenJI will be called simply the statistics 
of {C,S) and we shall say that "(C,S) has a statistics" without reference to /Xq.^ 



(i) Usually the map S is such that S~''M is contained in a lower dimensional compact subman- 
ifold of C: therefore almost all points (randomly chosen with respect to the Liouville measure) 
will never fall on M and it will make sense to ask if the Liouville measure has a statistics. 

(ii) Note that JI does depend on /Uq. It makes no sense to talk about the statistics of a dynamical 
system without specifying the distribution with which the initial data are randomly selected. 

(iii) If {C,S) has a statistics {i.e. if the Liouville distribution has a statistics) then one says 
also that (C, S) has a unique attractor. It might happen, and in some interesting cases it does 
happen, that the space C can be represented as a union of several open sets Ui and of some 
/io-measure sets Nk'. for almost all points in Ui the average of F is given by a formula like (1.3) 
but with some Jl^ replacing JI. In this case one would say that (C, S) has several attractors each 
with its own statistics. In the present analysis I shall not consider such cases: often they can 
be reduced to cases with imiquc statistics simply by redefining the phase space to be Ui rather 
than C; although this is not the most general case. 

A further general notion is that of attractor. 

2 Definition: The set A C C is an "attractor" for ii-random data if /i has a statistics in the 
dynamical system (C, S) and (1): SA = A ("A is invariant"), (2): JI{A) = I (A has "probability 
1 with respect to the statistics), (3): the fractal (Hausdorff) dimension of A is minimal among 



Adhering, only for lack of time, to the prejudice that the volume measure is somewhat privileged. 
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(1.3) 



Remarks: 
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the A's with the properties (1),(2). 
Remarks: 

It is not convenient to require that the attractor be a closed set (as sometimes done). In many 

cases the attractors A in the above sense are dense in C but have a fractal dimension strictly 
less than that of C. So the above notion is subtler than the ones that imply or require that 
an attractor be closed. It also stresses that an attractor will usually have a non trivial fractal 
dimension. On the other hand the attractor in this sense is not unique {e.g. one can usually 
remove from it countably many points, except in special cases like when A is a fixed point or a 
periodic orbit, or even larger sets with zero Hausdorff dimension and zero Jl measure). 

With the above notions in mind we can proceed to define some of the main qualitative features 
of the motions following data chosen randomly with respect to a distribution /xq. 

%2 Dynamical bases and Lyapunov exponents. 

A given motion analysis can starts with the attempt at understanding the behaviour of nearby 
trajectories: this means understanding the linearization of the motions taking place near it. 
For this purpose the notion of regular point is necessary: 

3 Definition: A point x is regular if the map S is regular in the vicinity of S^x for all k > 0. 
Recalling that S is always supposed to be piecewise regular this simply means that S'^x does not 
fall into the set M of singularities of S: i.e. x ^ U^qS'~'=A/' or x e n^o6'~'=(C/A/'). 

Let S'^x be a trajectory starting at a regular point x, k = 0,1, . . .: the trajectory S'^{x + dx) 
with dx infinitesimal will stay close forever to that of x departing from it by an infinitesimal 
amount simply given by dS''{x)dx, where dS''{x) is the jacobian matrix of S'^ evaluated at the 
point X. It is a X matrix if d is the dimension of C; by the chain rule dS''{x) is : 

k 

Tk{x) = dS''{x) = dS{S''-'^x) dS{x) = Jl dS{S''-^x) (2.1) 

where the factors appear ordered from left to right as j increases. 

I shall assume that for some C,e > it is \ det dS{x)\ > e > and \dS{x)\ < C, for all regular 
points x. This assumption could be greatly weakened, see the appended problems and [P], but 
it is convenient continuing the discussion without having to worry about such matter. 

Therefore the square of the stretching of dx will be described by the ratio: 

{dS'^dx, dS^dx) 
[dx, dx) 

where the scalar products are evaluated in the metric defined on C. If G{x) is the metric tensor 
in x and if the dilatation matrix {Mk)ij is defined by setting: M^. = {dS''{x))* dS''{x), we see 
that in the local coordinates the matrix is expressed by: {Mk)ij = {dS'' (x))'^ G{S'' x){dS'' {x))ij . 
So that (2.2) can be explicitly written as: 

Y.ijiMk)ijdxidxj 
G{x)ijdxidxj 

Suppose that the symmetric matrix M^^*^ has a limit D+{x), as fc — > oo. Then tere is c such 

that c > I dct_D+(,T)| > e by the assumption, and we can define Zi(.'r) > l2{x) > . . . ld{x) > 
to be its ordered eigenvalues, counted according to multiplicity. Let Vi{x), . . . ,v^{x) be the 
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(2.2) 



(2.3) 



corresponding eigenvectors, which can be taken orthonormal with respect to the "naive" scalar 
product (a, 6) = J2i'^i^i (which is the scalar product in which Mk and £)+ are symmetric). 
Then MkVj{x) grows as lj{xY, in the sense that ^ log \Mky_j{x)\ ^^^> lj{x), as an elementary 
estimate shows. 

We also call li{x) >l2{x) > ... > 'ls{x)ix), respectively Ai(.t) > A2(.x) > . . . > Xs(x){x), the 
distinct eigenvalues (respectively their logarithms) of D+{x) and ni(x), . . . , ^^(^.-((.x) their multi- 
plicities, i.e. the dimensions of the eigenspaces Ui{x),. . . , Us(x){x) spanned by the eigenvectors 
with the same eigenvalue. The lj{x),\j{x) can be called the spectral or scaling coefficients at 
X and, respectively, the spectral or scaling exponents at x. 

Then it is clear that the vector space R'^ can be regarded as containing the planes Vi{x) = 
D V2{x) D Vz{x) D Vg(^x){x) where Vj{x) is the plane spanned by the eigenvectors of D+{x) 
with eigenvalue < lj{x): the dimensions of Vj{x) are dj = nj{x) + . . . + ns(x){x) and the planes 
Vj{x) are Vj{x) = Uj{x)® Us(a:){x). 

It is very important to realize that the vectors Vj do depend on the choice of coordinates ( and 
of the metric G): by changing the metric or the coordinates the vectors v_j may, in general, 
change. However the eigenvalues lj{x), and their multiplicities nj{x) cannot change, and also the 

sequence of decreasing subspaces Vj{x) cannot change, as one can sec by noting that the scaling 
propeties of the vectors in such spaces are not metric poperties, but have an intrinsic geometric 
meaning. They can be characterized by the property that ^ log \dS'''{x)u\ ^^_^^> Xj{x) if u e 
Vj{x)/Vj.^{x). 

One could try to define the planes with given contraction rate to be just Uj{x). But such 
spaces would not have an intrinsic dynamical meaning: because one could "tilt" Uj{x) sligthly 
and still keep the property that all the vectors contract at the rate \j (provided the tilting does 
not generate components along the f7j+p(a;) with p > 0, of course). 

Likewise one cannot assign a special meaning to the planes Ui{x) © ... © Uj{x): this would 
in fact generate a system of planes such that if m is a vector in the j-th plane and not in the 
(j — l)-th then the exponent of dilatation will be A^-; hut such system is not uniquely determined, 
for the same reasons discussed in the previous paragraph. On the other hand the system Vj [x) 
has the property of being uniquely determined by the action of S, and it defines a system of 
planes along which the scaling size becomes weaker. 

4 Definition: Given the dynamical system (C, S) a point x € C admits a "system of scaling 
(or contracting) planes" for the forward motion, or for S, if: 

(1) x is a regular point. 

(2) there exist numbers Xj{x) and positive integers s{x) and nj{x), with j = 1,. . . ,s{x), with 

the properties: 

(3) the space i?'^ of the infinitesimal vectors out ofx can be regarded as containing a sequence of 
s{x) subspaces Vi{x) = R''' D V2{x) D . . . D 14(x)(a;) D 14(x)+i = Q with Vj{x) having dimension 

^,5(2;) + . . . + nj{x) and 

(4) the following limits hold: 



lim - log \dS''{x)v\ = Xj{x) live Vj{x)/Vj+i{x) (2.4) 
fc— >+(x) k 

Then one says that the numbers Xj{x) are the scaling exponents of S at x in the forward 
direction, their exponentials lj{x) = e'^^'-^^ are the scaling coefEcients, the number nj{x) is 
the multiplicity of the j-th coefficient. Sometimes the coefficients are repeated according to 
the multiplicity: in this case their number is, of course, exactly the phase space dimension. 
The scaling exponents (coefficients) for S are often called the forward Lyapunov exponents 
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(coefficients). 



Remarks: 

(i) There is no reason why a point should admit a system of scahng planes. 

(ii) A sufBcient condition for the existence of contracting planes is that the limit of the sequence 
of matrices (95** (x)*c?S''' (x))^/^* exists and is a positive matrix D, see the discussion preceding 
the definition 4, 

(iii) If Ai(a;) > 0, Xs{x) < a-nd ^ji^) ^oy all j's one says that a; is a hyperbolic point. In 
this case if r_ is such that Xj > for j < r_ and Xj < for j > r_ the plane = Vr_ will be 
called the contraction, or stable, plane. The motion of a hyperbolic point is very unstable in, 

essentially, all possible senses. 

(iv) If a point x admits a system of scaling planes so do all the points on the trajectory generated 
by X. Such points do have the same coefficients (multiplicities included) and the spaces Vj{x) 
and Vj{Sx) are related by: Vj{Sx) = dS{x)Vj{x), for all j's. The spaces Uj{x) instead do 
not have, in general, any covariance property because they are not intrinsically defined by the 
dynamics (see above). 

(v) If X admits a system of scaling planes then it may not admit such a system in the dynamical 
system (C, S~^); and even if it does there is no reason why there should be any relation between 
the two systems or the relative exponents, c.f.r. problem (32). 

(vi) Note, once more, that the subspaces in which the contraction exponent is < A for some 
real A, intermediate between the exponents values, are well defined (and coincide with Vj{x) if 
Aj_i > A > Xj). But the ones in which the rate is > A are not well defined and this explains 
the impossibility of associating expanding planes to the forward motion. 

(vii) Note that the scaling planes for S' at a point, when existent, concern the forward motion: 
i.e. they are properties of the trajectory S^'x, k > 0. Likewise the scaling planes for concern 
only the backward motion. 

In view of the above remarks it is important to try to establish some general results about the 

existence of systems of scaling planes. 

A simple case concerns the regular hyperbolic fixed points x: the latter always admit a system 
of scaling planes for the forward and backward motion, i.e. for 5 and for S~^. The linearization 
of the map S around x, i.e. the matrix T = dS{x) defines the plane spanned by the spectral 
planes of the eigenvalues of T with modulus < 1 and the subspace V" spanned by the planes 
relative to the eigenvalues of T with modulus > 1. The two subspaces V^, can be continued 
into two small regular connected manifolds tangent to them in x, AJ, A", such that if y e AJ 
then \S^y — S^x\ ^^^> bounded proportionally to if L denote the absolute values of 
the eigenvalue of T closest to the unit circle and < 1. Or, similarly, if y G A!^ or > 1 then 
\S~^y — S~^x\ ^^^> bounded proportionally to l^'^, if Z+ denotes the absolute value of the 
eigenvalue of T closest to the unit circle and > 1. 

In this case the forward scaling coefficients coincide with the absolute values of the eigenvalues 
of D, called in stability theory the Lyapunov coefficients of the fixed point. They can also 
be defined as the eigenvalues of the matrix D (whose existence is not completely trivial, see 
problems): lim„^oo((T'*)"T")-'^/^" = D. The backward scaling coefficients can likewise be 
identified with the eigenvalues of the matrix lim„^oo((T^'^)*T^'^)^/^'^ = Z)_ and, therefore, 
they are the reciprocals of the forward coefficients. The contracting plane for is, also, 
spanned by the eigenplanes of Z)_ corresponding to the eigenvalues < 1. 

It is convenient to set up the following definition for future use: 

5 Definition: If x is a fixed point the set U^qS''°A!^ = will be called the "global unstable 
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manifold", while the set U^q5 '^A^ = will be called the global stable manifolds of x. 

The two manifolds are locally regular around all their points that do not fall on the singularities 

of 5*^ or S~^, respectively, for some fc > 0. But they may be even disconnected, in general. 

The subspace tangent to at x coincides with the plane generated by the vectors associated 
with the Lyapunov numbers less than 1 which, as said above, is intrinsically associated with S. 
It consists of the infinitesimal vectors that contract at exponential rate under the action of S. 

One cannot identify "similarly" the subspace tangent to A", at x, as the plane generated by 
the infinitesimal vectors expanding at exponential rate under S because, as already pointed 
out, such vectors are not unambiguously defined (given a vector expanding at exponential rate 
one can add to it a vector that expands at a lower rate, getting a vector expanding at the same 
rate). However one can identify in the same way as A* by replacing S with S~^. 

The extension of the analysis to the case of a periodic point is very easy and well known and it 
will be skipped. In the case x is not fixed, nor periodic, the discussion above leads to a natural 
extension expressed by the definition: 

6 Definition: A point is said to admit a dynamical base {Zi, . . . , Zg), consisting of s mutually 

transversal planes, if: 

(1) it is regular for S and and it admits, both in the forward and in the backward motions, 
scaling systems of planes with opposite Lyapunov exponents. 

(2) no Lyapunov exponent vanishes. 

(3) the following limit relation holds as k ^ +oo as well as k ^ — oo: 

lim y log \dS''u\ = Xj{x) for u e Zj (2.5) 

fc— >±oo k 

In this case the planes Vj = Zg (B ■ . . (B Zj will be called the system of contraction planes for 
S and the Vj = Zi (B ■ . ■ (B Zj will be the system of expanding planes for S. If s- is such that 
Xj < for j > s- and Xj > for j < s- the planes = Vs_ and V" = K.-i will be called 
the stable and unstable planes of S at x. 

Remarks: 

(a) The corresponding notions for the backward motion {i.e. for S~^) are trivially related to 
those for S. 

(b) If « e Vj{x)/Vj+i{x) then (2.4) holds. 

(c) The points S'^x also admit a dynamical base and Zj{S'^x) = dS''{x)Zj{x) for all (signed) 
integers k. 

Finally one more definition is useful to simpliiy the language: 

7 Definition: A point x is called normal if it admits a dynamical base. 

Hence a normal point is a generalization of a periodic point. And it is remarkable that 

such points do exist and in fact abund, in some sense. The first classical result concerns 
invariant ergodic distributions ^ on C: here invariance means that ^{Cf^^S'^M)) = and 
Ijl{S~^E) = ii{E) for all (Borel)* sets E and ergodicity means that there are no non trivial 
constants of motion which are not /^-almost surely constant. 

I Theorem: Let /z be an invariant ergodic distribution for {C,S): then ^ almost all points 
are normal for S. Furthermore the Lyapunov exponents and their multiplicities are almost 

* This is in parenthesis because all the sets I will mention are Borel sets or diflfer from a Borel set by a set of 
measure with respect to the measure that is being considered and which I also call for brevity Borel sets if no 
ambiguity arises. In fact, I cannot even conceive the other sets. 
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everywhere constant. 

If no contraction or expansion coefficient has value 1 and if and are the contracting and 
expanding planes, then under further "weak" regularity assumptions, there exist (integr ability 
property) manifolds W" and WJ tangent in x to and V^, smooth near x, such that: 



and the approach to takes place exponentially fast bounded above proportionally to e , where 
X> is a suitable constant (such that no Lyapunov exponent is in the interval [—A, \]). 

A discussion of the "further regularity assumptions" would lead us away from the themes 
chosen here for discussion: I just mention that they are assumptions on the speed at which a 
regular point x can approach, in its evolution, the singularities, [P]. The idea is that such speed 
should be slower than any exponential (or at least slower than the speed corresponding to a 
bound A on the contraction and expansion rate). 

The above first statement is the Oseledec theorem: a "guided" proof to it is described in 
the problems, the statements concerning the integrability property are part of Pesin's theory, 
[P]. The theorem shows that if the data are picked up randomly with respect to an ergodic 
distribution (any one!) then they provide (non constructive) examples of points with dynamical 
bases. It also shows that the scaling properties of S and those of arc intimately related 
and the two maps show the "same" properties (or, rather, properties that are trivially related), 
provided the data are randomly distributed with an invariant ergodic distribution giving zero 
measure to data visiting, in their evolutions, the singularities. "Nothing about data randomly 
chosen with an egodic invariant distribution can be learnt by running the motion backwards, 
that cannot be learnt by running it forward" . In other words one can say that the motion is 
"reversible" on such data^ 

The case in which there arc contraction or expansion exponents is more involved and it 
will not be discussed. It certainly arises when the system has non trivial smooth constants 
of motion, each of which generates a vanishing Lyapunov exponent. But such cases can be 
easily eliminated because one usually can restrict the phase space to the data for which the 
constants of motion have a fixed value. When zero Lyapunov exponents are not related to 
smooth constants of motion, however, a genuinely more complicated situation arises: I shall 
not deal with it here, besides saying that some of the above kinematical properties do not depend 
on the assum,ption of the absence of Lyapunov exponents. In particular the first sentence in 
the theorem I above does not require that the exponents be non zero. 

Also the ergodicity, in some sense, is not really necessary and it has been introduced only to 
simplify the exposition. The case in which ergodicity is missing can be often reduced to the 
ergodic case by using the "ergodic decomposition theorem": a theorem and a discussion that I 
avoid here. 

A more interesting question is what can be said when the data are picked up with a distribution 
that is not invariant. The case of non invariant distribution of the initial data will be analyzed, 
under suitable further assumptions, in the next section. Here I add only a few remarks on a 
frequently misrepresented procedure. 

Note that the Lyapunov exponents for S and those of are opposite if the data are chosen 
randomly with respect to an invariant distribution. However usually one selects the data with 
respect to a non invariant distribution: in this case the above theorem does not say much. In 
particular one should expect that the asymptotic properties of the motions in the future and in 
the past are different and described by different statistics. For instance in cases in which there 
is a time reversal symmetry one will find that the Lyapunov exponents for the motions towards 

Remember, however, that such data are very special. 



|5-fea;_5-fey| 





for y e 
for y gW^ 



(2.6) 
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the past and the future are identical (rather than opposite). This will not contradict the fact 
that if the data are chosen randomly with respect to the future statistics then the Lyapunov 
exponents for the motion towards the past would be the opposite of the ones for the motion 
towards the future! 

How comes that often a method to measure the minimum Lyapunov exponent is said to be: 
"just measure the maximum Lyapunov exponent for the backward motion and change sign"? 
In most cases this would seem wrong: for instance when the system is time reversible because 
one would, instead, get again the maximum Lyapunov exponent. 

The "paradox" is understood if one examines what is really meant by the above statement, 
i.e. what is the actual measurement performed. One finds that the measurement consists 
(schematically) in picking at random with a distribution /xq (non invariant) a point x, fol- 
lowing its trajectory for a long time T, togheter with the trajectory of a nearby point y until 
it reaches a point S^x. The rate of separation of the two points gives^ a measurement of the 
maximum Lyapunov coefficient. The trajectory of x is memorized and that of y "thrown away" . 
One then starts from S'^x and from a nearby point y', and runs the two motion backwards: 
of course the motion of S^x is already known and one computes that of y'. The motions of 
both points y, y' is computed as a perturbation of the motion of x or of S'^x (forward and 
backwards respectively) thus it is very convenient to use twice the same trajectory as this saves 
considerable time. 

But this is not merely a matter of convenience: we see that S'^x is no longer a random point 
with the initial distribution. It is, rather, approximately a random point with respect to the 
statistics JI of the distribution /ig- Therefore it will show in the backward motion Lyapunov 
exponents opposite to the ones exhibited in the forward motion (which have the same value 
if the initial data are chosen with respect to /xq or /I), by the above theorem. Hence one 
gets the wanted result: in the improper sense just discussed the maximum exponent in the 
backward motion is the opposite of the minimum in the forward motion. This property is very 
reminiscent of the build up of correlations that appears in the theory of the molecular chaos in 
the Boltzmann equation, see [CB]. 

§5 Chaotic motions. 

The definition that I shall use for a chaotic system is: 

8 Definition: A dynamical system {C,S) will be called "chaotic" if: 

(1) There exists a periodic hyperbolic point O such that the global stable and unstable manifolds 
of O consist of regular points for S^^, and are smooth, connected and dense on C, (instability 
axiom). 

(2) The restrictions of S'^ to Wq, and of S~'' to Wq, are uniformly expansive for k > and 
uniformly contractive for k <0, (expansivity axiom). 

(3) If Xn,x'^ are points on Wq and \xn — x'j^\ n^oo' then the tangent planes to Wq at such 
points become parallel uniformly at speed \xn — x'j^\^ for some /? > 0, togheter with the matrix 
that linearizes the evolution S on Wq: a similar property holds for Wq; (continuity axiom). 

(4) The manifolds Wq,Wq form, everywhere they cross, an angle uniformly bounded away from 
0, (transversality axiom) . 

(5) Given any open sets G, Fi, . . . ,r„ there is a k such that S'^G has intersection with all the 
Tj for h > I (topological mixing axiom). 

Remarks: 

(i) To understand the role of the various conditions in the above definition note that from the 
hyperbolicity of O, and from the definitions of the previous section, it follows that the tangent 



Unless the two points axe very special, e.g. if y lies on the lower dimensional contracting manifold WJ. 
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planes to the manifolds Wq and Wq are the expansive and contractive scaling manifolds of O. 
However it does not follow from this only fact that the action of S on Wq , nor that the action 
of on Wq, are expansive. The latter property, as well as any statement on the Lyapunov 
exponents on Wq, concern properties of the future evolution on Wq, while the knowledge that 
Wq is the unstable manifold of the fixed point gives us only information about the motion 
towards the past. Hence it cannot follow that ((95'^(a;)* 9S'*'(a;)„)^/^'^, where dS{x)u denotes 
the jacobian of the transformation 5 regarded as a map of Wq, has a limit as fc — > oo. The 
role of (2) is to require such properties, or at least the part of them that we need, explicitly 
(as well as the corresponding ones for Wq). Its quantitative meaning is expressed as: there 
exist Do,X > such that for all x,y £ Wq and A; > the distance du{S~''x,S~''y) between 
S~'^x,S~'^y, measured along Wq, is: 

duiS-'^x, S^'^y) < Dae-^^d^{x, y) (3.1) 
and the corresponding properties are required to hold for Wq for < 0. 

(ii) Property (3) on the jacobians means that there exists Di,^ > Q such that the "unit vector 
orthogonal to Wq at a;", which is in fact a tensor that we can denote dWQ{x) verifies: 

\dWl^{x) - dWl^{y)\ < D^d{x,yf (3.2) 

for all x,y & Wq and that there exists D2, /? > such that the jacobian dS{x)u of S" as a map 
of Wq into itself verify, for aW.x,y & W^: 

\dS{x)u {dS{y)u)-' - 1| < D2dix,yf (3.3) 
The analogous conditions are also imposed on Wq and on S^^. 

(iii) the property (4) can be put in a quantitative form as follows: if dw" and dw'^ are two surface 
elements tangent to Wq and Wq, respectively, at a common point x then the Liouville volume 
of the parallelogram generated by them is iio{dw^dw^) = b{x)dw^dw^ and e~^° < h{x) < e^" 
holds for some constant Bq. 

(iv) The continuity above is a "Holder continuity"; it is convenient here being generous on the 
weakness of this assumption and not requiring Lipshitz continuity or higher smoothness. Even 
very smooth dynamical systems may have just Holder regularity of the foliation of phase space 
into stable or unstable manifolds. 

(v) The definition essentially yields what is usually called an Anosov system. A more general 

notion (of axiom A-system) could be envisaged. However in the cases considered here the 
attractors will always be dense in the full phase space and the above generality will be sufficient. 
All the items in the definition are essential; but in a sense the axiom (1) is the major one among 
them. 

In this section I suppose the system to be chaotic in the above sense. The following analysis 
(essentially due to Sinai, Bowen and Ruellc, [S2], [Bo], [R2]), will show that there is a well 
defined statistics for the Liouville measure and the attractor will be dense on C. The statistics 
on it can be determined quite explicitly and it can be shown to have strong ergodic properties. 
The existence of a well defined statistics plays the role of the ergodic hypothesis and it will 
therefore be called ergodicity property: it will appear that it has been introduced here under 
the disguise of the chaoticity assumptions (essentially the smoothness and density of the stable 
and unstable manifolds of O). 

We first describe how the measures /i^, the past and futiuc statistics of the Liouville measure, 
can be characterized. An apparently involved geometric construction is necessary: it is very 
simple in the case in which Wq and Wq are 1 -dimensional, hence the phase space C has 
dimension d = 2. I shall, however, provide a sketch of the construction in the general case: but 
one should first understand it (via drawings) in the trivial d = 2 case, to realize later that the 
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general case is its natural generalization. The geometrical construction leads to the definition 
9 below of Markov pavement: the construction that follows is somewhat different from the 
classical constructions of [S2] , [Bol] and, admittedly sketchy (except in the d = 2 case, in which 
it becomes a well known construction whose generalization is attempted here): its details will 

not be really used in the following and one could invoke here the theorem of Sinai stating the 
existence of a Markov pavement, i.e. proceed to the paragraph preceding definition 9 below. 

Given a small ball of radius S (small compared to the curvature radii of the manifolds Wq,Wq) 
centered at x € Wq the manifold Wq will intersect the ball on a (dense) family of connected 
surfaces: only one of them will contain x and it will be called "the disk on W with center x 

and radius S" . The disks with radius 5 on are likewise defined. 

The assumed density and continuity of Wq and Wq allows us to define "disks on W^ or W^" 
centered at any point x: they are the surfaces obtained as limits of a sequence of disks of radius 
6 on Wq (or Wq) centered at the points .t„ e Wq (or Xn G Wq) of a sequence of converging 
to a; as n — > 00. And the union of the disks that match smoothly with a disk on (or W^) 
centered at x will define the manifold W^ (or W^). 

Let A, A' be two small disks on Wq and, respectively, Wq centered at O. Let i^o be so large 
that the web generated by S'^''A and ^^''''A', fills the phase space C so densely that there is 
no point further away from the web than a prefixed 6 > 0, and so that the disk of radius 6 on 
W^ centered at any x € S^°A has at least another intersection with S'^°A (and "viceversa", 
exchanging A, A' and S, S~^). 

On 5"'° A we consider all the points Xi,X2,. ■ ■ common to S~'^°A': they form a rather dense 
set of points on S'^°A, and by taking i?o large we can suppose that there is a triangulation 
T of S^"A with base on such points^ consisting of smooth triangles T^" which have vertices 
at distances < 6. By slightly deforming A we can also suppose that the boimdary d{S'^°A) 
consists of sides of some of the triangles of T.^ Let a be the smallest distance between distinct 
triangles vertices and let e be the smallest opening angle at the vertices. 

Let dg be the surface formed by drawing through each x on the boundary of each triangle a 

A smooth triangle (one should call it pyramid but I use the more reassuring word triangle to make the object look 

simpler) T is d-dimcnsional smooth manifold W consisting in a compact set with connected interior containing 
d + 1 points, the vertices, and a boundary formed by joining them with d smooth surfaces of dimension d — 1 
each, faces, containing a different d-ple of the d + I vertices and with the further property that the set is 
conical (see below) around each vertex and with the intersections of any pair of them forming a smooth d — 1— 
dimensional triangle, face. The definition is recursive if one declares that a dimensional triangle is a point 
and a 1 dimensional triangle is a smooth arc. Here conical means that at every vertex one can draw an open 
cone: this also requires a definition, as the surface may be not flat; for instance one can consider a family of 
smooth curves emerging from the vertex and tangent at the vertex to a true open right cone with apex at the 
vertex and opening angle e > 0, and with all the points close enough to the vertex entirely contained inside 
the interior of the triangle. If the boundaries of the various triangles are required to be only Holder continuous 
rather than smooth one obtains the more general notion of triangle: the key transversality property remains, 
however, unchanged for such more general triangles and we call £{T), ri{T) the minimum of the opening angles 
at the vertices of the various faces of the triangle and, respectively, the minimum Holder continuity exponents 
of the various faces. Note that all the d dimensional triangles have boundaries with rf— dimensional volume. A 
triangulation of a compact manifold with base on a given family of points is a covering by a family of triangles 
with the vertices on the base, no interior points in common and no base point in their interior. All this is 
quite trivial in the case the triangle is 1 dimensional. Consider a family of points, distributed on a compact 
manifold with smooth boundary, dense enough so that every point has another one closer than 5 to it. Then it 
is possible to build a triangulation based on the given points by trimming the manifold boundary by an amount 
not exceeding, say, twice 5 if 5 is small enough. I take this as obvious, although a proof is likely to be somewhat 
verbose. 

The deformation is at most of the order of 5e~'^'^'^ . 
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disk on VT* centered at x and of radius 5. This is a codimcnsion 1 surface. It is not difficult to 
see that if the expansion rate A of 5 is large enough (so that 5Doe~'^ <C a) then we can slightly 
modify the sides of the triangles (hence A as well) so that the image of 9| is contained in itself. 
The expansion rate will, however, be in general much weaker than needed. In that case we 
imagine, for the time being, replacing S with a high iterate of S so that the new transformation 
meets the expansivity requirement. 

The construction can be done by successive approximations, recursively, as every change of a 
given triangle boundary performed to impose the wanted condition, will ruin the validity of the 
condition for some other triangle (but by a smaller amount). One will only be able to infer that 
the final triangles have Holder continuous boundaries, even when the Wq and Wq are (locally) 
flat (it is not difficult to see that the exponent of Holder continuity should be bounded below 
by ?7 = e~(^"' ^-^ if A+ is the maximal expansion rate and A_ > A is the minimal expansion 
rate on W^, which will be in general different: just think at what happens to an angle when 
the abscissa and the ordinate are contracted by a different scaling factor). In the case of one 
dimensional A, however, the construction is clearly completed "in one stroke" and the triangles 
are just smooth segments on Wq (and their boundaries arc isolated points, hence necessarily 
smooth (0-dimensional manifolds). The reason why in general the construction leads to triangles 
with mildly regular boundaries can be understood through the celebrated elementary Bowen's 
example, [Bo2], ultimately relating it to the direction dependence of the expansion rate on 
the unstable manifold, when it has dimension > 1 and at least two distinct positive Lyapunov 
exponents (so that A-|_ > A_ as expected from the above bound on the Holder continuity 
exponent). The deformation of the triangles, during the construction, is small (at most of the 
order of ^e"^''") but the regularity of the deformation is quite out of control (because of the 
mechanism in [Bo2]). 

An identical construction can be performed on the stable manifold leading to a triangulation 
of S~'^°A', with Holder cotinuous triangles. 

If T" and are the above triangulations of Wq and Wq we can consider the pairs of triangles 
Tf^,T^ with one point in common and consider the sets Ej = x built with the points 
obtained by fixing ^ G and rj € and by drawing the disk with radii 26 on W^ centered 
at r] and on W^ centered at ^ and by considering their intersection x = x{£,, rj) as ^, ry vary in 
and T^, (here 5 may have to be required to be small enough, always with respect to the 
curvature radii of the manifolds, so that the intersection defining x is certainly non empty). 

Such sets will be called prisms: the name "parallelogram" or "rectangle" is used in the litera- 
ture: but the construction here is somewhat different from the original one of [S2],[Bo] (which 
is not based on triangulations). Furthermore even in the original construction the sets called 
parallelograms look to me, no matter how I attempt to draw them, prisms except of course in 
the two dimensional case, when they really look like parallelograms. The sets T," and will be 
called, respectively, the horizontal and the vertical axis of the prism Ej = x T^. Therefore a 
natural name for the sets d'^Ej =T^ x dT^ and d^Ej = dT^ x T§ will be horizontal boundary 
of the prism Ej and, respectively, vertical boundary of Ej (or unstable and stable boundary 
of Ej). The set i9'* = Ujd'^Ej 9" = Ujd^Ej define the total vertical or the total horizontal 
boundary of the family of prisms Ej which themseves constitute a pavement of the phase space 
with small prisms. 

The above notion of prism as a set having the form E = x , see the above two last 
papragraphs, where T" and are small connected surfaces, with Holder continuous boundaries, 
on the stable and unstable manifolds of a common point x is more general; so that one can 
consider a pavement £ of the phase space C with prisms, £ = (Ei, . . .) and define, likewise, the 
stable and unstable boimdaries d^,dj of the prisms Ej and the total horizontal and the total 
vertical boundaries 9" and 5*: note that 5**, 9" have zero volume (by the Holder continuity of 
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the boundaries). Given such concepts the following definition is useful: 

9 Definition: A pavement £ = {Ei,E2, . ■ .) by prisms is called a Markov pavement (or a 

Markov partition) if the total stable boundary is mapped into itself under the action of S and 
the total unstable boundary is mapped into itself under the action of and they have both 
volume. 

The key property of the special pavement £ constructed above, and motivating the last def- 
inition, is that by construction the total vertical boundary of the prisms is mapped into (a 
small portion of) itself under the action of 5* and likewise the total horizontal boundary 5" is 
mapped into (a small portion of) itself under the action of S^^. Hence we have just seen that 
special Markov partitions, associated with the stable and unstable manifolds of a hyperbolic 
fixed point, exist for a chaotic system at least if S is replaced by a high enough power of 
S. Such special Markov partitions are particularly suited for numerical applications, sec [FZ]. 

But it follows immediately from the definition 9 just given (and the covariance of W^,W^: 
SW^ = W^^, SW^ = W^^) that if £ is a Markov partition for then the partition Sp£ (whose 
prisms are obtained by transforming with the map S'^ those of £) is also a Markov partition 
for the iterate of S. Furthermore the partition obtained by intersecting the prisms of the 
partitions SP£ with p taking K consecutive values is a Markov partition for S. Hence any 
dynamical system which is chaotic in the sense of definition 8 admits a Markov partition £, 
[S2]. 

We can form, given any integer and a Markov partition £ a more general and finer Markov 
partition £^ simply by "intersecting" the partitions S~'^£, S~'^~^^£, . . . , S'^£. We shall only 
consider sequences of Markov partitions of the above form £^. 

Given £ we consider the map T, : x —>■ 2.{x) mapping x to the sequence aj defined by ^■'a; G E^^^ , 
the history of x on the partition £. 

This map is unambiguously defined for all x such that S'^x does not fall on the boundary of 
any one of the parellelograms E„ of £. If we define the compatibility matrix element Mcra' to 
be 1 if SE^ n ^ 0, and otherwise, where E^ is the set of interior points of E^, then it 
is clear that the history a{x) of a point x, for which it is unambiguously defined, is an allowed 
sequence: in the sense that Hjl-oo -^o-i.o-i+i = 1- Viceversa, and this follows directly from the 
niappimng properties of the bomidaries of the prisms in definition 9 (attempt a proof only after 
finding via a drawing in the trivial d = 2 case), if a is an allowed sequence there is always a 
point X producing a as history, provided one chooses conveniently the definition of a-j{x) in 
the cases in which this quantity is ambigously defined (because some iterate of x falls on the 
boundary of some E^). Item (5) of the chaoticity assumption, in definition 8, implies that the 
matrix M is "transitive" , i. e. there is an integer p such that M*' has all entries non vanishing, 
[S2]. This important property may be implied by other more topological properties, see the 
discussion of the axiom A in [Sm], but here we do not discuss this point (in the same spirit 
animating what precedes and follows, of not trying to get "optimal theorems" operating with 
"minimal assumptions"). 

If q_ is the history of a;: a = Ex, then the history of Sx is where <^ is the shift to the left 
of q_. This means: S(5'x)j = (Sx)j+i. 

Even when the history of a point is ambiguous it can be seen that the number of possible 
allowed sequences is never more than a number that can be explicitly bounded in terms of the 
maximum number of triangles with one vertex in common and the dimensions of W^, W^. But 
we do not have to worry about this, since the boundaries of the prisms of £^ have zero volume 
measure and we are interested only on properties valid for the data x which have probability 1 
with respect to the Liouville measure. 

Therefore, for our purpose, we can think that the points of C can be described by allowed 
(in the above sense) sequences of symbols a: we shall denote K the space of such sequences, 
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defining on it a metric that sets to 2^' the distance bctwcciii two sequences which agree on the 
sites —q, ■ ■ .,q but not on any larger symmetric interval. Here 2 is arbitrarily chosen and any 
number > 1 could replace it, generating the same topology on the space of sequences. In other 
words we can use the sequences a as a system of coordinates (a slight variation on the usual 

representation of the cartesian coordinates in some digital representation of the reals, e.g. base 
10; but the present representation is intrinsically tied to the dynamics). 

It is convenient to think of a history sequence a = as a configuration of a one 

dimensional spin system, to make more striking the analogy (that will emerge below) with the 
theory of one dimensional statistical mechanics : thus we call the labels j, labeling the times 
marking the history (in units of fo), with the name of sites. Hence the value of g_ at the site j 
will be (jj. 

The Liouville measure /io becomes a probability distribution /Xq on the space of the allowed 
sequences. In fact consider the set of sequences denoted 

consisting in the sequences a_ whose values aj with j = —q,...,q coincide with the given aj. 
Such set G' will be naturally given a probability equal to the /XQ-measure of the set: G = 
if E is the above history map. 

Thus setting /iQ(G") = //o(G), with G = S^^G", for any (Borel) set G' in the space K allows 
us to think of the dynamical system (C, S) as (K, (p) and the probability measure hq becomes 
Mo- 

The main point is that there is a simple formula expressing the probability i1q{G'), (Sinai). 

To find it just remark that the set G is (clearly) a prism for the Markov partition £q (because 
G = r\'^^_gS^^ E^o). If /3* denotes the surface of the vertical axis of — x T§ lying on 
Wq then the area of the vertical axis of G is essentially given by: /3*_^ Hj^-g ^siS^x), where 
x is a point in G and Ac(a;) is the absolute value of the jacobian determinant of S, at x, as a 
map from to Wg^. 

Likewise if is the surface of the horizontal axis T" of E^r on Wq then the area of the 
horizontal axis of G is, essentially: /3"^ Yl'j=o ^u^i^''^) where A„(a;) is the absolute value of the 
jacobian determinant of 5, at .x, as a map from to Wg^. 

Therefore the /xq measure of G is, essentially: 

-1 q-l 

( n ^^(^'^■)) • ( n K\s^^))pi ■ bix) (3.4) 

j=-q j=0 

where b{x)dwdw' is the volume element {i.e. the fiQ^dwdw') corresponding to two surface el- 
ements dw,dw' tangent to, respectively, W^,W^ (see remark (iv) above). The "essentially" 
means that there is an error in (3.4) due to the finite size of the sides of G: see below for its 
treatment. 

To connect (3.4) with something more familiar we can go through the definitions and re- 
marks that led Ruelle, [Rl] , [R3] , to call what follows the thermodynamic formalism for strange 
attractors. Define: 

h+ia) =logAuix{a)), h-{a) = -logAsix{a)) 

(3.5) 

hs{a) =log(3%ao), /i„(ct) = log/3„(CTo), hn{x) = -logb{x{aj} 

and remark that /iq, h- as functions of aj depend very little on the aj's with j large in the 
sense that ther exist constants Hj, k> such that for 7 = m, s, -|-, — , 0: 

\h-y{a) - h-y{a!)\ < H^e-"" (3.6) 
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if CTj = (j[ for \i\ < q. This follows again from the the continuity and hyperbolicity assumption, 
because varying q_ on the sites j > k-th means varying x{a) in 

-k . . . k 
a-k . . . (Jk ^ 

which is a prism whose diameter has size of order 0{5e^^^), see (3.1). Therefore, by the 
continuity assumption in definition 8, we see that the variation of hj, j = 0, ±, u, s, is bounded 
by 0{e~^''^S^). This is usually quoted by saying that dependence of hj on ufe "vanishes 
exponentially" or hj has "memory vanishing exponentially" . 
In terms of the functions h^, 7 = s, m, +, — , we can rewrite (3.4) as: 

This shows, unless the approximation involved in the discussion of the word "essentially" above 
spoils everything, that the probability distribution /Uq on K coincides with a Gibbs' state on K 
for the short range non translation invariant formal hamiltonian: 

— 1 00 

^ /i_(^a) + /io(o:) + ^/i_(^V) (3.8) 

j=-oo j=0 

Such a state, to the far right {i.e. in the far future) corresponds to the Gibbs state /Z^ with 
hamiltonian X^^oo h+iV'' s) the far left {i.e. in the far past) corresponds to the Gibbs 

state with hamiltonian X^'^oo ^-{f''s.)-^ 

The last statement defines unambiguously Jl'^. as probability measures on K, via the theory of 
one dimensional Gibbs states ([R5],[Bo],[Ga3]). This is so because the Gibbs states potentials 
have short range (by the above remark on the shortness of the memory, the potential decreases 
exponentially, see the paragraph following (3.5)), and no phase transitions are possible, the 
system being one dimensional. By the map S the distributions Jl'^ are transformed into the 
natural candidates for the forward and backward statistics /!_(_ for the distribution /zq. 

Hence the only problem is the check that /Iq is really the Gibbs state on the allowed seqiiences 
with formal energy (3.8): this means discussing quantitatively the error mentioned in connection 
with (3.4). All the properties of /I^ would then follow from the well known theory of the one 
dimensional short range Gibbs states for spin systems, [R5],[Bol], including the property that 
the /I± are the forward and the backward statistics of i^q. The latter property holds simply 
because the Gibbs state with hamiltonian (3.8) really converges to 'p'_^_ or to /!_ if it is observed 
to the far right or to the far left. Thus the characterization of /I^ would have received a complete 
description. 

To correct the error involved in the above use of the word "essentially" one can use the deeper 
properties of Gibbs states; namely their characterization in terms of the DLR equations. To 

use the DLR theory one needs an expression of the ratio of the probabilities that two sequences 
q} and a'^ have the value a or a' at site 0, conditioned to the two sequences having the same 
value in all the other sites. This is the ratio of the probabilities for the events in which the spin 
at site is fT or a', having fixed the same configuration for the spins at all the other sites. 
This means considering the ratio of the probabilities: 

. . -1 1 . . ,V 

lim y^-. . . a a, . . aj 

/ fl —q ■ ■ —1 1 . . q 



One should not forget that (3.8) is defined for the allowed sequences. Hence one should more precisely say that 
the hamiltonian of our spin system is (3.8) if the configuration is allowed and it is +oo otherwise: i.e. the spin 
system has a (short range) hard core coresponding to the compatibility condition. 
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where a = {<Jj)^^ is an arbitrary (allowed) sequence. Then clearly the error involved in (3.4) 
can be regarded as a multiplicative factor correction to the ratio in (3.9) approaching 1 as 
g — > 00. If a;^ and denote two allowed sequences differing only in the entry with label 0, one 
gets that the value of the limit in (3.9) is, rigorously: 



■x;:^(?»-(vV)-'>-(v^v))-(''o(£^)-?io(£"))-x;r(''+(^'2:')-''+(yV)) 



(3.10) 



which is the DLR characterization of /ij, as the Gibbs state with formal hamiltonian (3.8): 
proving, by the uniqueness theory of the short range one dimensional Gibbs states and the DLR 
theory of the Gibbs states, that indeed /ig is the Gibbs state with energy (3.8), [Bol],[Ga3]. 

Hence the forward and backward statistics Jr^ are precisely identified and described, because 
the theory of one dimensional short range Gibbs states is very well developed. It also follows 
that /U± are ergodic (in fact isomorphic to a Bernoulli shift) measures, (see [Ga2]). 

We proceed to derive some properties of /x"*" by making full use of the short range (3.6) of the 
potential generating JT^ as a Gibbs state: they will be used in the application in §4. 

Let Ft, T > {) denote the set of the allowed strings aj , | j| < T: these arc the strings that can 
becontinued to infinite allowed strings € K: we in fact imagine to continue each to some 
(arbitrarily). 

Then 77+ can be constructed as a limit of the distributions /iy defined by the average they 
assign to an arbitrary smooth function F: 



I 



^ e-S--''+^^'^"V(x(a)) 
t.T{dx)F{x) = ^ (3.11) 



Zl-T,T] 



where Z[_t^t] is the normalization factor,that should be called the "partition function" of the 
energy h+ relative to the interval [—T,T]. 

Since JI'^ is the Gibbs state with potential /i+ it follows that JI'^ is the limit of fir as T ^ oc, 
[S2]. We now investigate the probability distribution of a random variable w = J2j=-t ■fi^''^) 
with / smooth and t > 0. This is done by considering the probability distribution JIrp ^ defined, 
on the smooth functions F, by: 



/ 



,,T,tidx)F{x) = t^^^ 1^=11 (3.12) 

normalization 



We want to show: 



Theorem II: // / is smooth the probability distribution of F = Yl!^-t ^ i'^'' ^) ~ ^ computed 
by using the distribution JLj-^, T > t is different from that obtained by using Ji^ by a factor 
bounded, for each value of w, between e~^f and e^f for some Bf which is T,t independent. 

Remark: this is a simple consequence of the short range ((3.6) of h+ and the genral theory 
of Gibbs states on one dimensional lattices. Note that the set of strings in K (the space of 

the compatible strings) which agree with a given G is a prism of the Markov partition 
£t = n^^_jnS'*'f ; so that the sum in (3.12) can be regarded as a sum on the prisms of £t 

partial proof, since / is smooth the function f'{a) = f{x{a) also has exponentially vanishimng 
memory, as the h-y in (3.6): hence there exist C,^ such that f'{a) — f'{s.')\ < Ce~'"^ if 
agree on the sites between —q and q. Therefore we suppose first that /' depends only on 
the sites j G [— r, r], ("finite range assumption"). The the value of w = '^^^i f{S^ x(a_)) is 
entirely determined by , \j\ < t + r, so that the probability of w can be computed with the 
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distribution: 

Z[-T,T] 

but since /i+ has short range the above numerator is essentially the product of the partition 
functions relative to the intervals [— T, —t — 1] and [t+ 1, T], with some a;* dependent "boundary 
condition" (in the sense of the Gibbs states terminology). Therefore the ratio is bounded 
between: 

- TL. , ^[-r,-t-i] %+i,T] ^3_;l4) 

■^[-T,T] 

where B^. is a T, /-independent constant: note that the ratio of the partition functions is large 
or small (exponentially in T, T) but it is not random {i.e. it is independent of 0;*+''). 
This proves the theorem if / has finite memory r: if / has exponentially vanishing memory 
the result has to be proven by approximating it with a finite memory function and passing to 
the limit; I do not discuss this problem further. 



^4 Time reversible systems. Ruelle 's principle. 

As an application 1 consider the system in example 3, §1, to show that the analysis of §3 can 
have some far reaching consequences and relevance for a fundamental theory of non equilibrium 
ensembles. Consider a 2 dimensional periodic box containing a few circular obstacles disposed 
so that there are no coUisionlcss trajectories (this is the case called OiJ in [GG], "no horizon"). 
The box contains N particles and the equations of motion are the (1.1), i.e. with the usual 
symbols: 

ij = —Pj' Pj = -9qVe^tiq.)-dqV{q^,...,q^) + Ei-a{p)p. (4.1) 

where Vgxt is the hard core potential {i.e. it is really a boundary condition imposing elastic 

reflections on the obstacles) and V is a short range pair potential energy. The a represents 
a friction mechanism to keep the total energy H = X]jP^/2m + V{q^, . . . ,q^) bounded. We 
choose: a{p) = Ei-Y^-p./Y^-p^. which keeps H constant, exactly. This is called a "gaus- 
sian thermostat" from the meaning it has when V = Q. The case V = Q,N = 1 has been 
"completely" studied in [SI], [GELS]. 

The dynamical system generated by (4.1) can be regarded as defining a dynamics on the AN 
dimensional full phase space !F , or on the AN — 1 dimensional surface of constant energy S, or 
on the AN — 2 dimensional manifold C consisting in the phase space points of £ in which one 
particle is exactly colliding with one of the hard obstacles. We denote the dynamics on 
St that on £ and S will denote the map defined, on C, by mapping one collision to the next. 

The system properties are: 

(A) Dissipativity: the phase spaces T.,£^C contract at an average rate (27V — 1)(q!), where 

(a) is the time average of a, at least if we suppose that the forward infinite time average (a) is 
positive, as it seems intuitively correct and as some numerical experiments seem to support. 

(B) Reversibility: the map i : (g,p) {q, —p) is such that, if t ^ x{t) is a solution of (4.1), 
then t ix{—t) = {ix){t) is also a solution. 

I will furthermore suppose that the system verifies the following property: 

(C) Chaoticity: in the sense that "things go as if the system was chaotic" . 

The strict validity of this assumption in the sense of definition 8, §3, is n,ot true for the model 
(4.1) already for N = 1, simply because the system is not smooth. For large N testing its 
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approximate validity is very hard, but hopefully something will be done in the future. The 
best one can hope is that the failure to be verified is not "relevant" for the discussion of the 
thermodynamics of the system, in the same sense that the failure of ergodicity is believed to 
be, in many cases, irrelevant for the equilibrium thermodynamics {i.e. in the limit as i ^ oo 
with fixed particle density and periodically repeated obstacles). 

This is essentially Ruelle's principle: its operational meaning is to proceed as if the system was 
chaotic in the sense of the definition 8 of §3, or some similar definition, and then to suppose 
that the deductions arc correct even though we have no way to check the chaoticity assumption 
or even though it does, strictly speaking, fail (as in the case at hand, (1.1)). One can hope 
that the assumptions should become more and more "true" as N grows. They fail, as just 
mentioned, for instance ii N = 1, see [CELS]: but, in this particular case, some conclusions of 
§3 would nevertheless be correct even if N = 1, as it seems likely to be implied by combining 
the present analysis with that of [CELS]. 

For instance an example of a property that is expected to hold, if the chaoticity assumption 

holds, is a strong, exponential, decay of the correlations between smooth observables, [S2], [R2], 
[Bol] . The above assumption is therefore expected to correspond to a strong exponential decay 
of the correlations, at least over a time scale that is reachable by the experiments that can be 
conceived in order to check the picture that we are developing. 

The latter decay has not been experimentally investigated (because it is really difficult). But 
after a decade of uncertainty it is becoming increasingly acceptable that indeed the correlations, 
in the iV = 1, = case, may decay exponentially: in the paper [GG] the "simple" cases 
V = 0, N = I and several choices for Vu arc studied. No evidence for a non exponential decay 
could be detected, at least on time scales reliably attainable by numerical experiments (and, 
of course, for the simple observables considered). Essentially all the data confirm, instead, an 
exponential decay, both for the observables of the collision system (C, S) and for the continuous 
system {£,St). In this respect "things go as if the system was chaotic". In spite of the 
evidence for the failure of the assumption, we have therefore an example, in the case least 
favourable for thermodynamic interpretations (as A'' = 1), strengthening the view that "things 
may go as if the system was hyperbolic" expressed here. 

The existence of time reversal invariant periodic orbits can be deduced for many arrangements 
of the obstacles: hence we shall also suppose, for simplicity that the periodic point discussed 
in the definition of chaoticity is just a time reversal invariant periodic point. And to simplify 
further the analysis we assume that in fact O is a fixed point (it should be clear that the analysis 
that follows would not really change if O was just a periodic motion). 

Other properties can be expected to hold for our model, on the basis of analogies with similar 
models in which they have been shown to hold. I make here a list, hoping that someone will 
find their experimental check worth of some effort. They will not bee used in the following but 
they make the general picture brighter. 

(D) Density: following [LPR] we label the non negative Lyapunov exponents of St (on £) 
as A^, j = 2N — 1,2N — 2, . . . ,n+, in decreasing order, and the negative ones as A^, j = 
2N — 1, ... , n_ in increasing order, discarding from the enumeration the trivially zero exponent 
associated with the flow direction. Thus the maximum Lyapunov exponent is Amax = •^2iv_i- 
Then in various models it has been experimentally^^ found, starting with the work [LPR] 
which has been followed by other interesting cases, (e.g. [ECMl], [SEM]), that the graphs of 
X = j/{2N — 1) — » A^ seem to have a smooth limit f±{x) as iV ^ oo and x > x^. No 

In our paper there is only one experimental result that can rise, in our opinion, doubts about the deeay law. 
It is the result corresponding to the extremely long run leading to fig. 6 of [GG] ; see the discussion there. More 
recent work, [AGO] , provides much stronger further evidence on the exponential decay. 

By experimental measurement I mean here also a computer experiment: as I think it does not conceptually 
differ from the older notions of experimental measurement. 



18 



experimental result seems to be available in the case (4.1): I shall assume this property to hold 
for it. The Lyapunov exponents of S (on C) will be toXf, where to is the average time between 
any two collisions, which is the time scale associated with the map S). 

(E) Pairing: it has been noted that in various cases, starting with [Dr],[ECMl], that: 

At + XT =j- independent = - (a) j = 2N -1,... (4.2) 
Where (.) denotes (forward) time average (over oo time). 

The relation (4.2), that will be called strong pairing property, has been experimentally found 
to hold for many other models, as it emerges from the subsequent analysis in [SEM]. It is not 
yet clear whether it holds strictly or up to corrections of 0{N~-^). 

Remarks: 

(1) The pairing property appears to be related to several features of the equations of motion 
among which the reversibility. As essentially suggested in [Dr] one may think that the property 
holds, for instance, for locally hamiltonian equations (as (4.1) with a = 0) with a constraint 
imposed via a variational principle (like the Gauss' least constraint principle or some general- 
izations of it). But no proofs are available and, furthermore, the case [ECMl] and others in 
[SEM] do not fall into this cathegory, although (4.2) holds for them. 

(2) In other models considered in [SEM] a weaker property seems to hold: 

-C{a) < At + AT < (4.3) 

for some C > 0, that I will call the weak pairing property, whose essential feature is the N— 
independence of C.^"^ Neither (4.2) nor (4.3) have been tested numerically for the present 
model. I consider quite likely that the strong pairing rule holds in this case. 

(3) The pairing property and the density property imply that the Kaplan Yorke fractal di- 
mension of the attractor (also called the Lyapunov dimension), see [ER], is macroscopically 
smaller than the full dimension 4iV — 4 of the phase space, by an amount 0(A~^(a)iV): the 
proportionality to N of the loss in dimension is sometimes called "dimensional reduction" , [PH], 
[GELS]. 

Going back to the system (4.1), after the assumption (G) above, I want to discuss its conse- 
quences, with the main objective of finding some that could be tested and thus provide a test 
for Ruelle's principle for the statistics JI, and to see if any experimental consequences can be 
drawn from it (accessible by using the present day technical capabilities). 

Under the assumptions (G) the "principle" is in fact a theorem (see theorem II, §3): the reason 
it is called a principle is that, as already mentioned, the hypotheses of definition 8 are not to 

In [SEM] the Lyapunov exponents are considered in the full phase space JF. As discussed in [SEM] the rules 
(4.2) or (4.3) may fail to be obeyed in a few cases: but this seems often related to the way the exponents are 
counted and computed. For instance in some systems there may exist observables called in [SEM] "constants of 
motion" or "conservation laws" which, if initially having some special value, conserve it in the evolution, (note 
that this is a somewhat unusual notion of conservation law, as it holds only for special initial values), or have 
slow oscillations around it . For instance H or the flow direction in (4.1), or the total momentum P in the case 
Vext = (in the latter case this is conserved only if P = 0). And one is naturally led to fix the values of such 
observables or to perform the measurements in some special way {e.g. by timing them appropriately when there 
is a periodic forcing). This is done to simplify the calculations, eliminating trivially behaving coordinates: but 
the end result might be in a odd number of exponents and in a consequent apparent " failure" of the pairing 
rule. 
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be expected to hold rigorously for our model: they must be regarded to hold approximately, 
and this is the reason we say that to use them is a "principle", which we regard as correct for 
all the consequences it may have about thermodynamic quantities. 

Note first that the time reversal symmetry implies that Wq and Wq have the same dimension, 
equal to half that of the phase space, i.e. 2N — 1. 

A key idea behind all what is said in this paper, and on which intuition can be built, [Gal], 
is that of regarding A, rather than as a fractal set of ~ 4A'' dimensions, as a smooth surface of 
dimension 2N — 1 consisting in the unstable manifold Wq of some periodic point O. Note that, 
in this way, there is unification between the equilibrium [E = 0) and the non equilibrium cases 
(in both cases Wq is smooth and it has the same dimension). 

One should avoid thinking of A as a nasty fractal: following [Gal] one must think of it as an 
infinite uncut folio {i.e. Wq) confined into a finite region (the phase space C) and folded over 
and over again to fit into it, thus forming the (uncut) "book" A. Forcing simply introduces 
some (small if E is small) wrinkles on the folio accounting for the (mild) fractal nature of A. In 
particular the dimension of Wq stays 27V — 1 and does not change because of the introduction 
of the forcing (while the dimension of A changes by 0{N)). 

The unification between the equilibrium theory and the non equilibrium one is made possible 
by regarding the attractor in the phase space C as 2iV — 1 dimensional (rather than 4:N — 2 
dimensional as one would be normally tempted to do). 

The real question is whether, if E ^ 0, we can predict anything that could be experimentally 
checked as a test of Ruelle's principle. 

The work [ECM2] provides evidence in this direction. I apply here the ideas of [ECM2], [CG], 
to show their relevance to the present case (as an example of the claim made in [CG] about the 
generality of the ideas). 

We can study the fluctuations, in the stationary state Ji, of the average (2A'' — l)(a)( of the 
phase space contraction rate {2N — l)a over time stretches tro: 

1 /•tro/2 -, */2-l .roiS^x) 

{a)Ax) = — / a{S.ex)dd= V / a{S^S^x)dd (4.4) 

where tq is the average time elapsing between timing events, i.e. the average (over infinite time) 
of the time interval tq {x) between the event x and the successive Sx; the is the continuoum 
time evolution on the surface of constant energy. 

The quantity (2 A'' — l)(a)( is also called the entropy production rate on the trajectory stretch 
between S'~*/^.t and S'^^'^x: it is related to a transport coefficient (the conductivity, i.e. to the 
ratio between the particle current and the field E, see [GELS]), at least in the present model 
(because the relation between entropy production and a suitable transport coefficient might 
only hold for a special class of examples, see [CL]). 

If the quantity t is small compared to the duration of the experiment the quantity {a) ^ fluc- 
tuates around the mean value (a) (defined as the (forward) average of a over infinite time). If 
we write: 

{a)^{x) ^ {a)at{x) (4.5) 

we define a dimensionless random variable at{x) with (forward) average 1. We can divide the 
axis at into intervals Ijg, j = 0, ±1, . . ., with Ijg = [j6, [j + 1)6] for some small 5 and measure 
the probability distribution of at by counting how many times at takes a value in each Ij^ when 
measured at the phase space points into which x evolves at times multiples of tro, 

The measurement of {a)^{x) requires measuring the value of a points <S'~*/^x, . . . , S*'/'^~^x. 

The probability Tr{p)dp that at e {p,p + dp) can be computed from (3.13) with T t by using 
the Markov partition £t constructed in §3. The prisms of the partition £x arc; naturally labelled 
by the allowed strings a G Ft (see the paragraph preceding (3.11)); for brevity we label them 
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with a label denoted j. If Ej is a generic prism of this partition with axes intersecting at Xj, 
consider the measure /Ut,* defined by setting, for every smooth F: 

I F{x) iJLT,t{dx) = — b=i (4.6) 

where A„_((a::) = Y]^-2~t/2 ^A'^^ '■'') is defined before (3.4) and Xj is a point in Ej. In the limit 
T — »■ (X) the distribution // gives the correct probability to the values of an observable F which 
is smooth and has a t dependence like the one in (4.4), {i.e. like the one considered in theorem 
II, with f(x) = Tq^ /o"'"^'' a{S0x)d'd) up to an error which is a factor bounded above and below 
uniformly in t, by theorem II. 

The partition £ constructed in §3 turns out to be time reversal invariant, if the triangulation 
on VF' is taken to be the i image of the triangulation on W^: this means that if Ej G £ then 
iEj = Eji e £ for some f. Note that if we used another Markov partition £', constructed 
for instance via the classical proofs of existence of Markov partitions of [S2],[Bol], then the 
time reversal symmetry of our system would still imply that i£' is also a Markov partition (as 
= iWf^) so that the partition obtained by intersecting the two would still be a Markov 
partition, with the extra property of being time reversal invariant; hence by using £' fl i£' 
instead of £ we could still carry the argument that follows which only depends on the time 
reversal invariance of the Markov partition £ used for the construction of and not on the 
particular Markov partition used. Note that if £ = i£ then also the partition £t in theorem II, 
£t = n^^_-pS^£, has the property of time reversal invariance £t = i£T- 

Thus, for instance, up to the mentioned error of a factor (T, t)-independent: 



Trip) T,j,at{xj)=pK,t{^3) 



(4.7) 



where the sums run over the labels j of the prisms Ej e £t (with axes intersecting at xj) 
verifying at{xj) G {p,p + 6). 

By using the time reversal symmetry the above sums can be seen to consist of sums with the 
same number of addends. This can be done by pairing the contribution from Ej G £t with 
that of iEj = Ej' that can be easily seen to give, if T is large (so that the prisms are really 
small), a value —p to at{xj') if at{xj) = p and at the same timcAu.t{xj') = t i^j)- Hence 
the addends can be paired so that the ratios of corresponding addends is: 

^n,ti^j) ^ (2N-l)(a)ptT„ KS*''^Xj) 

where Ag^tix) is defined as Au,t{x) = n*{f-t/2To ^ui'^''^)^ '^^^'^ ^c{x) being the absolute value of 
the determinant of the jacobian matrix of S as a map of into Wg^. Eq. (4.8) simply follows 
from the fact that a volume element around S^^'/'^x varies under the action of S'* by a factor 



For instance, [ECM2], from tfie two identities 5" '^{S'^x) = x and 5" '^{iS ^x) = ix we deduce: 

dS^{S-^x) dS-^(x) = 1 = dS-^{S^x) dS^{x) 
dS-^ (iS-^x) i dS-^ix) = i 

We see that, by applying the first relation with x replaced by x' = iS~'^x and by applying successively the third 
(with X —* x') and the second relations (with x — » S~'^x, using also x = iS~'^x') in (*) one deduces: 

dS'^{S-^x')idS'^{S-^x)i=l (**) 

Therefore; ( det 95^(S-^5')) = det dS'^ (S'^ x) = e-(2JV-i)rTo(a)a^(x)^ ^^ere to is the value of the timing 
interval in real units (for us T is an integer); i.e. , replacing x with S~'^/^x: Or (a;) = — ar(a;')- 
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Kuj,{x)Kg^t{x)b{S*'/'^x) /b{S~*'/'^x) where b{x) describes the transversahty of the intersection of 
the stable and unstable manifolds at x, as defined in remark (iii) to definition 8: so that this 
equals exp— (27V — l)t To{a) ^{x) , (here tq is considered extremely small, for simplicity). 

As repeatedly mentioned above (4. 6), (4. 7) require a correction: but it has been remarked after 
(4.6) that the correction is a factor bounded by for some B which is p-indcpcndcnt because 
of theorem II; and we can include in B also the correction due to the last ratio in (4.8) (bounded 
uniformly in t, by transversahty property in the chaoticity assumption, see definition 8 in §3). 

This means that 7r(p) verifies {N large): 

1 , -k{p) , , . , 

— log — -. > (a) = p - mdependent 

2NtT0P Tr{-p) W^oo.t^oo \ / y f ^^g-j 

= ^-i(p)+p{a)Nt±B 

where £,{p) is an even function of p (over which we have no control) and B docs not depend on 
p nor on t (but it may depend on N, E). The (4.9) can be considered as a large deviation result 
(both in N and t): its peculiarity is the p independence of the coefficient of p in the odd part 
othe argument of the exponential, to leading order in t. 

The p,t,N independence of the first relation in (4.9), and the equality to (a) can be quite 
easily tested even if N is moderately large (but not too large: as in such case the measurements 
simply could not be carried out): A'' = 64 seems feasible. 

The idea of the above test, and its realization in a model different from (4.1), is in the basic 
paper [ECM2] and it has been further developed in [CG]. The test does not require the mea- 
surement of the individual Lyapunov exponents: therefore it should be (relatively) easy to carry 
out. Particularly after its feasibility has been proved in similar models in [ECM2]. 

Finally it is remarkable that one can, at all, find tests of the principle: in the case E ^ there 
is no really well established non equilibrium thermodynamics with which one could compare the 
results of the principle. Such general results would constitute the real test of the principle from 
the point of view of Physics. A non controversial theory of non equilibrium thermodynamics 
could, actually, follow from the principlc\ For this, however, we must learn how to extract other 
consequences, just as we learnt to extract consequences from the Boltzmann Gibbs principle. 
And it is important to stress that whatever predictions it gives they should be as true as the 
statements derived in equilibrium are: i.e. essentially exactly true. Therefore the check above 
(and others that might be devised) will be satisfactory only if it gives exactly the expected 
results, within the experimental errors. This seems to be the case in [ECM2] for the model 
considered there, see also [CG]. 

§5 Summary and outlook 

Here I regarded the Ruelle's principle as valid: to clarify its status I considered of some interest 
setting up a list of assumptions that would imply it rigorously. This essentially amounted to 

saying that the system behaves as an Anosov system (or more generally I could have supposed 
something like an axiom-A behaviour) and then use the theory of Sinai (or, respectively, of 
Bowen and Ruelle) to "prove" the principle, see §3. The interest of course is not in the math- 
ematical theorem as I just made enough assumptions to make it valid and checked its validity 
by adapting the ideas of the key papers [S2],[Bol],[R2], (while attempting, to avoid repetitions, 
at describing a more intuitive construction of Markov partitions), but rather the interest lies 
in the clarification of the meaning of the property (C) in §3. One cannot be too demanding on 
the matter of mathematical rigour: it should not be forgotten that even the ergodic hypothesis 
of Boltzmann is far from being proved, particularly in the generality one would want. 
The "application" deviced in §4 is not very satisfactory as a test of validity of the principle 
becaiisc it can be pc^rfornicd only if N, r arc not too large. It would be, of course, nice to find 
a true thermodynamic property that could be computed and tested via the principle. 
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1) One should remark that the above analysis should improve with N, as the non chaotic phe- 
nomena should become less important: they are excluded by the strict chaoticity assumption, 
see §2, but they are possibly present if the assumption is made in the loose form of §3, (C). 
They would be certainly present if the hard core potential in (4.1) was replaced by a smooth 
finite range potential steeply diverging at zero distance (by general results from KAM theory). 

2) Predictions analogous to the ones at the end of §4 could be made for other models, see [CG] : 
but except in the cases in [ECM2] the numerical experiments do not seem to exist, yet. 

3) The new principle appears to play the role that the crgodic hypothesis plays in equilibrium 
statistical mechanics: therefore one may be led to think that the time of approach to equilibrium 
should be of the order of the recurrence time on the attractor. Having set up a unified point of 
view for the equilibrium and the non equilibrium cases, we can adopt the classical explanation 
of Boltzmann, [B96] , [B02] , apparently relying also on an earlier suggestion by Thomson, [T] , 
intended to contradict such a hasty conclusion. The rate of approach to equilibrium is very 
short (essentially determined by the Boltzmann equation in the case of rarefied gases, as ex- 
emplified, for instance, by the Lorentz theory of conductivity in metals, [Be], closely related to 
model 1) at least if one looks at the very few macroscopic observables relevant for equilibrium 
thermodynamics and for the transport coefiicients: the reason being that such observables have 
the same value on most of the surface Wq, see [Gal]. 

6) Concerning the particularity of the gaussian thermostat, which could be called an "unphysical 
fiction", I think, see [Gal], that there should be, also in non equilibrium, several equivalent 
ways of describing the same stationary distribution corresponding to different Jl and to different 
physical ways of reaching the stationary state. And it might well be that the gaussian thermostat 
turned out to be equivalent to other models of thermostats, which could be described by rather 
different attractors. For instance a stochastic thermostat, in which a particle colliding with the 
wall comes out with a maxwellian distribution at given temperature, will certainly be described 
by a statistics JI which is absolutely continuous with respect to the Liouville measure. In 
the thermodynamic limit this might just be the same as the result obtained with a statistics 
which, for finite N, is on a strange attractor. This mechanism is like the one realized by the 
microcanonical and the canonical ensembles (the first is concentrated on a set of configurations 
which has zero probability with respect to the second, as long as TV < oo). This is clearly a 
question that requires further investigations. 
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Note that a stochastic model of thermostat is described by a stochastic differential equation and therefore our 
discussion does not apply without some major modification. 
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Appendix: Philosophical questions; and a few concrete ones 



The following is a guided series of problems to the general theory of the Lyapunov exponents: it 
contains the ergodic commutative, subadditive and non commutatitive theorems and the Oseledec 
theorem. The theorems are mostly philosophical, i.e. they hold with essentially no assumptions. 

The problems are taken out of the preprint of the book Meccanica dei Fluidi, in italian, circu- 
lating in the form of a draft,: the Oseledec theorem exposition is essentially taken from [R4]- 

(1) If /I is a probability measure on the Borel sets of R" and if A„ is a sequence of measurable sets such that 
/j(A,i) < +00 then almost all points are contained in at most a finite number of sets in the sequence (Borel 
Cantelli theorem). (Idea: the set of the points in an infinite number of A„'s is N = n^j(Uj^j,A/i), of course. 
Therefore IJ,{N) < ^^j. A»(A^) for all fe's, hence n{N) = because the series converges.) 

(2) Let (C, S, /i) be a triple formed by an invertible dynamical system and an invariant probability measure 
on (the Borel sets of) C {i.e. there is a zero measure set N such that S is invertible outside N and l-i{E) = 
IJ.{SE) = fj,{S~^E) for all Borel sets E C C/N). Let / be a (measurable) function bounded by a constant K 

almost everywhere (with respect to ^). Let Dn be the set of points x such that some average of / over a time 
< n is non negative: i.e. X/J— /C"^^^) — ^ ^'^^ some rn < n. Then f{x)ti(dx) > 0. (Garsia maximal 

averages theorem). (Idea: if n = 1 the condition defining Di is simply f(x) > and nothing has to be proved, 
besides the obvious. If n = 2 the condition defining D2 is either f(x) > or f(x) + f{Sx) > 0. Therefore the new 
points, i.e. those in D2 but not in Di, are points x where f{x) < which can be paired with point 5a; G Di so 
that f{x) + f{Sx) > 0. Hence we can subdivide D2 in the disjoint union of D2/D1 U 8(02/ Di) and of D1/SD2. 
On the last set it is f{x) > while the integral over the first union can be written as / (fix) + f(Sx))iJ,{dx) 

because, by the invariance of fi the integral J^^^ j f{^)t^i<ix) = f(Sx)^{dx). The case n = 3 is only 

slightly more involved and it is left to the reader (proceed in the "same way"), and, once understood, the general 
case becomes crystal clear.) 

(3) The invertibility assumption is not necessary in (2): the invariance in the ordinary sense /i(E) = ix{S~^E) 
for all Borel sets E is sufficient. Prove this statement. (Idea: one has just to try to formulate what said by 
always using S^^: for instance D2 will consist of the points in ZPi and the inverse images of those in ZPi such 
that f{x) + f {S-^x) > 0, etc.) 

(4) Show that (2) implies the almost everywhere existence of the limit limj.^00 i 1 /('^"'^)- (Wea: let 
fsup{x) = lim 

^^Pfc^oo ^^j— /C*^^-^) and finfi^) — liminf/^^oc ^ X-^j— .^('^^■^)* The denial of the statement 
is that fsup{x) > a > 13 > f infix) on a set D which has non zero measure for a suitable pair a > /3. Quite 
absurd because on the (obviously) invariant set D the functions }{x) — a and /3 — f(x) would have some non 
negative average. Hence by the previous theorem their integrals over D would have to be > 0, but their sum 
would therefore be (ft — a)fi(D) which would be < 0, because we are (foolishly) thinking that /i(-D) > 0!) 

(5) The boundedness assunption in (4) can be replaced by the summability assumption / 6 Li (/x). Furthermore 
if f{x) denotes the average of /, defined /x almost everywhere, show that: 



\f\L, = / M(da:)|/(x)| < / f.^{dx)\f{x)\ = \f\L, 
Jc Jc 

J l{x)l^{dx) = J f{x)ii{dx) 



If there are no non trivial functions f{x) which are constant of motion fx almost everywhere, i.e. if the system 
is ergodic, then the function fix) is a constant (almost everywhere) and f = l^idy)fiy) for all / € Lii^). 

(6) Let (C, S, /i) be a dynamical system as in problem (4). And let fnix) be a sequence of measurable functions 
such that: 

|/i(x)| < K, fn+mix) < fnix) + fmiS"x) /i almost everywhere 

Suppose fi ergodic; by applying the ergodic theorem (."?), (4) above show the existence, fi almost everywhere, of 
the limit limn— ►oo ^/"(^) = /(^): {Kingman's subadditive ergodic theorem). (Idea: Remark that the functions: 

/sup(a;) = limsup —fnix), finfix) = liminf —fnix) 

n^oo ri n—*oo Tl 

verify fsupiSx) > fsupix) and finfiSx) > fi„fix) (because fnix) < fiix) + /n-i(5x), then divide by n), 
so that the /i invariance of fx implies that they are constants of motion (because JifsupiSx) — /(a;))d/i = 0). 
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Therefore, p. almost everywhere, fsup(x) = /3 and finfi^) = ce where a < /3 are suitable constants. Let us 
consider a number rj > such that a + rj < 13. 

Let An be the set of points where fn{'J^) < Q + »? for at least one value of m < n: hence lim„^oo tJ-(An) = 1. 
Given e > there is, therefore, a such that ^(A^^ ) < e if denotes the complementation operation on sets. 
If X C one can suppose that the frequency of visit of x to A^^ is < e, because such frequency is the average 
value of XA^ i^'' ^) (over j) if XA denotes the characteristic function of the set A (by the ergodicity). 

Consider the sequence of times ji < j2 < ■ ■ ■ when, instead, 5-"= a; S AJ^^: it follows that the number p of such 
j's with ji^ <T is such that p/T < e for T large enough (as p/T tends to the frequency of visit to A^^). 

Let fco be the first time < T in which S^" x S A„^ and let fcg be the largest integer < T such that fy _ j.^ (S'°° x) < 

(fey — kQ)(a + rj). The point fe,', + 1 must be one of the j's, otherwise it could not be the largest integer k' such 

that f.t^f.^ fk'-koi^'"''^) < o + ?7 by the subadditivity of /, unless of course fcg = T. 

Let fci > k'fj be the first value, with ki < T, not among the j's, i.e. such that S'^ia; G A„j and let k[ > fci be 

the largest value such that (S^'^x) < (fc^ — ki)(a + r}), and so on. 

In this way a sequence [fco, fc^], • ■ ■ , [fcs, fc's] is constructed in the interval [0, T] . All the values fc < fc^ outside the 
intervals must be among the j's (hence their number is < p). The last value k'^ will be, possibly, followed by a 
string of values among the j's but the first value fc^+i that does not have this property, if existing at all, must 
be within of the value T, otherwise we could form the interval [fcs+i, k'^^-^]. Therefore the number of values 
outside the intervals is bounded above by p + n^. The subadditivity then implies; 

s 
i=l 

which shows the one cannot contemplate the case a < /3, by the contradiction it does provoke: hence a = jB.) 

(7) Show that (6) implies that lim„^oo ^{x) = fix) is a /U-almost everywhere constant, and: 

lim -/„(a;)s/= lim - j fn{x)iJ,{dx) = 'va.i— I fn{x)ti{dx) 

n— »oo n n — ►oo n J ^ n Tl J ^ 

(Idea: subaddititivity implies }n{x) < fi{x) + fn-i{Sx}; hence f{x) < f{Sx) and < J^{f(Sx) — f{x))djji = 
imply f{Sx) = f{x) /x-almost everywhere. Then ergodicity yields that / is constant. By dominated convergence 
the first limit relation follows. The sequence n (fn) = fn(x)n{dx) is subadditive and bounded by K; hence 
^(/n> n^oc' infn ^(/n), by an elementary argument.) 

(8) Show that the assumption |/i(a;)| < A" in (6) can be replaced by the summability of /+, f~^{x) € Li{ij,), 
if f~^{x) = max(0, /i(a;)). (Idea: no idea is necessary; just a careful examination of the proofs in (6), (7).) 

(9) Show that the ergodic theorem implies that if N has zero measure and N is measurable then fi(SN) = 0. 
(Idea the frequency of visit of the motion starting at x to the set SN, <px{SN), is equal to that to N itself: 
<pa:{N) = (px{SN); hence by the ergodic theorem n{SN) = J (px{N)dfj. = J (px{SN)dn = iJ.{N) = 0.) 

(10) Define, via the ergodic theorem, the "future" and "past" averages of / 6 ii(/i) as the limits f^{x) = 
lim ^ /(S^^a;). Show that /+(a;) = /~(i;) /i-almost everywhere. (Idea: Let D, a, /3 be such that ;i(£)) > 
and f+{x) > > a > f~{x) for x & D. Let D+ be the set of points x & D such that Yl'^jZl fi^^'-'^) = 
(/)m(^) > fo"" ra> n. Let be the correspoonding set where {f}^{x) < a. Then, if n is large enough 
^-(n-i)£)- ("!£)+ 0^ because the quantities n{Dn) = /i(5~("~^)l3^) and n{Dn) are both very close to ^l{D) 
for n large. Hence for x in the latter intersection it is a > (/)^(a;) = (/) J(S~'"~^^a;) > /? by the properties 
defining £),D^,D^. This is impossible.) 

The guided sequence of problems will now come "closer to Earth" by studying some finite 

matrix theorems. 

(11) Let L be a real d x d non singular matrix. Consider the matrix M„ = ((I/*)"-L")^/^ and call t^."^ 
the eigenvalues of M„ ordered by decreasing size. Show that the largest eigenvalue of M„ is such that the 
limit i log t^"^ n^oo' -^1 exists. Here * means transposition with respect to the scalar product (u, v) = 
^^iiiDi. (Idea: first show that the eigenvalues t^"' of ((L-^)"L")-'-/'^" , ordered according to decreasing 
size, do have a limit as n oo. Note that fn = log|L" | verifies subadditivity fn+m fn + fm and 
fn < "-loglLI, if \L\ is the norm of the matrix L (i.e. \L\ = max |Li;|/|ii|, with \v\ = {v,v)^^'^). Hence the 
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limit n ^ log |L"| n^oo' ti = inf„ i log \ L" \ exists, and ^fn> min log | if fij arc the eigenvalues of L. But 
|L"?;| = (L"-v, L"-!;)i/2 = (^^ (L*)"L"i;)i/2 < maxi<j<d(tj"')i/n, if |^| = 1. Hence the largest eigenvalues have 
the appropriate convergence property.) 

(12) Think of the vectors in R!^ as functions i —* Ui on the finite space F s (1, 2, . . . , d). Let (ii'*)^« be the 
space of the functions on F'' which are antisymmetric: these are the functions Mi— which are antisymmetric. 
Define a scalar product on such functions by setting {U,v) = ^ Uij^,,,i^Vi-^,,,i^ . Define the matrix L^'? by setting; 



Show that the result of (12) implies that the matrix (((L^9)*)"(L^'')")-'-/^", where the * denotes the adjunction 
operation with respect to the scalar product («,f), is such that its largest eigenvalue logarithm divided by n 
has a limit as n — > oo. 

(13) Let (t^"')^ be the eigenvalues of (L*)"L" in decreasing order, repeated according to multiplicity. Prove 
that the eigenvalues of ((L^'')")* (L^«)" are just the products of the g-ples of eigenvalues (t^"')^ • • • (4"^)^ '^^t'l 
ii 7^ ij (i-e-. products of g-ples of pairwise distinct eigenvalues). (Idea: no idea is necessary.) 

(14) Combine (12) and (13) to infer that the limits logtj"^ = Aj exist for all j = 1, . . . ,d. Let Ai, . . . , Aa be 
the distinct limits Xj and let mi, . . . ,ms be their multiplicities {i.e. the number of Aj which are equal to Xj). 
Verify that "^i = <^ ^''^'^ define r(i) = j if Aj = Xj. 

(15) Let be the linear space spanned by the first mi eigenvectors of A„ = ((L* )"L")^''^ ; likewise U^"^ 

will be the space spanned by the next m2 , and so on until U^^^ is defined. Show that the notion of multiplicity 
introduced in (14) is even more justified by proving the following "orthogonality" property between unit vectors 

u e C/^"' and u' e u''^'^''\ k>0. Given 5 > there exists C > such that: 



\(u,u')\ <Ce-(l^'-VI- 



5)n 



(Idea: nothing to prove if r = r' , of course. The "easy case" is r' > r. Let A„+i = t')"'^''^ Pi be a spectral 

decompositon for the matrix A„_|_i. Then supposing n so large that for m > n it is |^ logt^™'^ — Ai| < <5i for 
all r{j) = i and all i, where for a given Si > 0, Si < ^ min(|Ar. — A^/ 1): 



\iu,u')\ se-("+i)V max^,^^(„+i) |(«,E.(i)=.' e("+^^Vp,„') = 
=e-("+l)V max^,^^(„+i, |(E.(,)..' e("+l) V P,„, „') < 
<g-(n+l) V \y , e("+i) V Piu\ < 

— I Z_-/r(i) — r' * I — 

<e-("+l)V+(«+l)«i \y , t("+l)p,n| < 

— I Z-^r{i) — r' « I — 

<e-("+i)V + ("+l)'5i |((L*)"+i L"+i)i/2n| 
=e"("+i)V+("+l)'5i(M, (L"+i)* L"+ln)l/2 

Thus, again by the spectral theorem and by \TT'\ < \T\ \T'\ we deduce: 

\{u,u')\ <e-("+l)(V-«i)|L"+l«| < 

<e-(n+l)(V-5l)|i||j;,n„| < g-(n+l){ V - A. -25i ) |^| 

because u £ U^"\ This completes the proof in the case r' > r and fc = 1. The case A; > 1 is simply obtained 
from the case A; = 1 by applying the latter inequality k times. One finds, since the series converges, the result 
with a C that can be taken Ci = l-I^Kl — e~^)~^ if < a; < min(|/i,. — /i^/ 1) — 2Si for all n large enough and all 
A; > 0. 

Consider the case r > r' . Let Ua be an orthonormal base with the first mi vectors spanning C/-^"^ , the next m2 

spanning £/^"' and so on; let be the corresponding base for n + k. The orthogonal matrix ^V^^^f = {ua^u\) 
verifies the inequalities: 

'\Wa,a'\<^ for all a, a' 

|W„,„/| < Cie"''^'-(°)^'-(°')-2'5il" if r(a') > r{a) 
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If r(a') < r(o), however, the orthogonality implies that W^a' = {W^^)a' a- The latter quantity is "just" the 
determinant obtained by deleting the row a and the column a' from the matrix W. Sueh determinant consists 
in a sum of (d — 1)! products of d — 1 matrix elements Wij picked up in pairwise distinct rows and columns (by 
Cramers' rule). Since we are interested in the non diagonal elements of W^a' see that in each product there 
must be at least enough factors Wfjfji with r(/3') > r(/3) so that '^tJ-r(f)') ^ Mr{/3) ^ Mr{a) f^r(a') (hint: cheek 
this first when there are no degenracies, i.e. r(a) = a). Hence we use for such factors the second inequality and 
just bound the others by 1. We finally get the result with C = Ci{d — 1)! if 25i(<i — 1) < <5.) 

(16) Show that (15) implies that the planes Uj"^ C il'^ have a limit Uj as n — » oo and the planes Uj are 

pairwise orthogonal. (Idea the planes U^"'^^^ and f/^"^*'' must form with the planes ^^^y^ U^""' an angle 
closer to 90° by an amonut prefixed arbitrarily if n is large enough. Hence they form a Cauchy sequence of 
planes converging to some C/i. The other planes are treated analogously.) 

(17) Check that the previous problems (11)h-(16) imply that if L is a real d X d non singular matrix the limit 
lim„^oo((i*)"i")^/^" = D exists and it is a non singular positive definite matrix, with eigenvalues equal to the 
absolute values of those of L, counted according to multiplicity. The eigenspax;es C/j"* of Dn S ((L*)"L")l/2n 
spanned by the r{j) eigenvalues of D„ that converge to the j-th distinct limit value are planes that converge to 
limit planes Uj which are the eigenplanes of D which correspond to distinct eigenvalues. (Idea the only thing still 
to check is the relation between the eigenvalues of D and those of L. Suppose for simplicity that the eigenvalues 
of L have pairwise distinct absolute values (hence they must be all real) and order them by decreasing absolute 
values m, . . . , fi^. Let L = ^ . fJ.jVj X v* be the spectral resolution of L where Lvj = fJ-jVj, L*v* = /ijf * and 

{v*,v) = 1. Clearly \L"vj\ = \nj\"\vj\. Hence = Ai. The other equalities can be deduced for instance by 
the "trick" of considering the matrices L^', as in (13).) 

Going back to the realm of abstract thinking the following guided problems, combining the 
concrete theory (11)^(17) and the philosophical results (1)^(10), lead to the Oseledec theorem. 

(18) Given an ergodic dynamical system (C, 5, fj) with an invariant distribution /i such that there is a zero 
IX measure invariant set N outside which the transformation S is invertible and non singular, consider the 
matrix 95"(a;) = T^(x}. Suppose that |detT(a;)| > e > and that \T{x)\ < E in C/N. Check that T„{x) = 
T{S"~^x) ■ . . . ■ T(Sx) ■ T(x), and that fn{x) = log |T„(a;)| verifies the subadditivity property of the subadditive 
ergodic theorem of (6). Therefore: lim„_>oo ^ log |T„(a;)| = Ai(a;) exists /i almost everywhere. 

((19) Define the matrices T^'^(x) on i?/^'' as in (12), (13) above, and by repeating the argument there (with 
(18) replacing (11)) prove that the limits limn->oo ^ log t^."' (a;) = Xj{x) exist almost everywhere for almost 
all X. 

(20) Show that the analysis in (15) can be repeated word by word even when the matrices are re- 
placed by T"-{x), in the points where the limits in (19) exist, i.e. almost everywhere. The matrix D = 
\im^^^{(T^(x)Tf^{x))^^'^^ has eigenplanes U\(x), . . . , Us(^^\{x). Show that the spaces Vj{x) = Uj{x)®. . .ffit^s{i:) 
can be identified with the system of scaling planes for 5 of definition 6, §2. 

(21) The contraction exponents \i{x) are defined almost everywhere and they are constants of motion togheter 
with their multiplicities mj{x). Therefore they are n almost everywhere constants. (Idea: By (20) and an 
argument like that in (17).) 

(22) Check that all the above results can be derived by just requiring that T(x) is such that log"*" |T(a;)|, where 
log"*" denotes the positive part of the logarithm function, is /U-summable. (Idea: just careful examination of the 

proofs.) 

(23) If T{x) is replaced by any d' x d' -matrix valued function x — > 0{x) with log"*" |0(a;)| /x-summable the 
"same results" can be proved. For instance if 0„(a;) = OiS^-'^x) ■ 0{x) then D„ = {0*{x)0„{x))'^/^" has 
a limit D as n — > oo which is almost surely independent of x, and the eigenspaces, spanned by the eigenvalues 
of Dn whose eigenvalues converge to the same limit Xj, converge to the eigenspace of D with eigenvalue Xj. 
In other words the above theory extends trivially to a theory of products of random matrices 0{x) generated 
by selecting randomly by a distribution /i on C a point x and the corresponding random matrix 0{x) and by 
multipliyng 0(5"-i(a;) . . . 0{x). 

(24) Let f{x) = dS-'^{x), d3{x) = T(x). Then f{x) = T{S-'^x)-'^ and Tfe(5*=a:) = (95-*= (5*2:) can be written 
either T{Sx) . . . T{S^x) or also as T~^{x) . . . T~^(S''~^x). Show that if x admits a system of contracting planes, 
i.e. /U-almost everywhere, the eigenvalues i^^^ (ordered by decreasing size) of (T^(5'°a;)T)j(5'^a;))^/^ = have 
the limit: lim i logt^.*"' = Xj = —X^_j, /i-almost everywhere. (Idea: note the following expression for Tj.: 
f*{S''x) = f*{S''x) ...f(Sx) so that the mati ices are nmltiplied in the correct order for the application of 
(23): hence (Tj, (S*=x)T * (S'=x))i/2'= (identical to {T*{x)Ty,(x))-^) converges to a matrix D = D^^, /i-almost 
surely (or if x admits a contracting system of planes). Thus the limits exist, because the spectrum of T*T 
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and that of TT* coincide in general, if T is non singular; their relation with the oppositcs of the contraction 
exponents \j in the forward direction is derived from Tf^{S^ x)T^{S^ x) = (T^ (Sx)Tk(Sx))^^ .) 

(25) In the context of the problems (20), (24) consider the system of planes Vj{x), at x, defined by the 
eigenplanes Ujix) of the matrix D of problem (20) by Vj = C/s_j+i ® ... ® Ui{x), and existing /i-almost 

everywhere. Show that if G Uj{S^x)/Uj+i{S^x) and Wf^ = dS^{x)u (hence u G (jj(x)/Uj-^-i(x)) then 
limfe^oo i log \T{S'')wk\/\wis\ = Why one should not call Vjix) an expanding plane when j is such that 

^s-j' < fo'' j' < 3 ^■'id positive otherwise? 

(26) Let us call the spectrum of a sequence of matrices On the d numbers u>\, . . . ,u)d obtained by considering the 
logarithms of the eigenvalues of (O^Oj.)'^/^*', o^*^' > . . ■o^^\ and setting, when they exist: limj.^;,^ oj*^ = ujj. 

Show that the spectrum of the sequence Ok{x) = Ti^(S~^x) coincides with that of Tfe(x) ^-almost surely. 
(Idea: if the functions n — > fn(x) and n fn{S"x) are subadditive, in the sense of problem (6), the functions 
n — > fniS^x) and n fn{x) are such that F = linin^oo — fniS'^x) and G = limn^oo —fnix) exist and are 
constant /i— almost everywhere, by (6) above. Suppose that F > G. Consider the sets D„ and Dj^ consisting in 
the points where fm{S"^ x) > F — e > G for all m > n, or respectively where -^fmix) < F — e lor all m > n. 
Then n{Dn) , fi{D'^) -+ 1. Hence for n large enough S-^D'^ n £>„ 7^ 0: if a; is in this set then j^f{S"x) < F-e 
because 5"a; € D'^ and ^fn{S"x) > F — e because x 6 D„. Hence this contradiction shows that it is sufficient 
to check that the two functions are subadditive: this comes from the inequality \AB\ < |yl||_B|.) 

(27) The spectrum, see (26), of T_fe(x) = f{S-'^''-^'> x) . ..f{x) is the opposite of that of Tfc(x-): = -\d~j+l- 
(Idea: T_fe(a;) = {T^{S-''x))-'^; hence T_fe(a;)*f_fc(a;) = {{Tk{S-''x){Tk{S-''x))*)-'^; but the spectrum of the 
last matrix is the same as that of the matrix (Ti^{S~'°x)*T^.{S~''x))~^ which by (26) leads to the result.) 

(28) The forward and backward systems of scaling planes exist /i-almost everywhere and have opposite corre- 
sponding exponents: Xj = — ai^-j+i. (Idea: this is a corollary of what already discussed in the few preceding 
problems.) 

(29) Show that if V-j(x) are the contracting planes for S~^, defined /i-almost everywhere, then Vr{x) n 
l^_(s_,.+2) (x) = and Vr{x)(BV_^g_r^2) = R'^t /i-almost everywhere. (Idea: since, by definition, the dimensions 
of Vr and of y_(j,_^_|_2) are complementary to d one only has to check the first statement. Note that if ti S Vr{x) 
it is |Tfc(x)'!i| < e*(-^''+'') |tt| for any prefixed 5 > and for k large enough and all x £ Dj. where Dj. is a 

set such that fi(Di^) — > 1. For the same reason \T_^{S''x)v\ < e-(^'-i+'')'=|D| for v e V-r{S''x) provided 
S'^x £ Df,. Since T_f,{S^x)V_(^g_j.j^2){S^^) = ^-(s-r+2)(3;) we see that all the u G V_(g_j.j^2){^) have the 
form u = T_k{S''x)v with v € V-r{S''x). So that using Tk(x)T_k[S''x) = 1 we get: 

\T.k{S'=x)v\ < e-^^--^-'^''\mx)T_k{S''x)v\ ^ 
^ \Tkix)u\ > e'-^'-^-'^^''\u\ for x G S-'^D^, u G V_(^^_r+2){x) 

and if a; G S~''D^. n D^. this is impossible because A, — 1 > Xr if also u G Vr{x) (so that |Tj.(a;)«| < e^'^'^~^^'^\u\) 
and d is smaller than the twice the minimal difference between the scaling exponents.) 

(30) The planes Wj{x) = Vj{x) n y_(3_j_|_i)(a;) are such that 

i log \T±k {x)u\ fe^^> Xj for u G Wj (x) 

and Vj =Wj @ Ws, V_^^_j+2) = Wi © • . • ® . Also T{x)Wj{x) = Wj{Sx). This also implies the 

relation between the dynamical bases claimed in the theorem II. (Idea: this is just a corollary, or a summary, of 
the previous problems.) 

(31) Show that if //, is non invariant the foward and backward Lyapunov exponents (and the relative systems 
of planes) may exist and be different. (Idea: consider the example on C = [— 1,1] x T^, where is the 
two dimensional torus. Let x = {z,tfi,<f2) be a point in C. Let z — » /(z) be a smooth map such that 
/"(^) „^±oo' =•= 1' ^"'i '^(•^) = 2 if z > and 1/(2) = 1 if z < 0; define: 



x' = [z! ,ip\^'2) = s V'l + '^(2)¥'2 mod27r 

L i^(z)¥'i + (i^(z)^ + l)iyJ2 mod27r 

and check the statements using = dzdipi dip2/2(2iT)'^ .) 

(32) Find the positive and negative Lyapunov exponents in the example suggested in problem (31); find also 
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the dynamical bases (of the points which admit them). 
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